Selective compression for network connections

ABSTRACT

A system, apparatus, and method selectively provides content compression to a client based, in part, on whether the network connection from the client is determined to be a high latency, low-bandwidth connection. The present invention gathers one or more network metrics associated with the connection from the client. In one embodiment, the metrics include estimated TCP metrics, including smoothed round trip time, maximum segment size (MSS), and bandwidth delay product (BWDP). These estimated network metrics are employed to make an application layer decision of whether the client connection is a high latency, low-bandwidth connection. If it is, then content may be selectively compressed virtually on the fly for transfer over the network connection. In one embodiment, the selective compression uses a content encoding compression feature of the HTTP protocol standard.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation patent application of U.S. patent application Ser. No. 10/957,024, filed on Oct. 1, 2004, entitled “Selective Compression For Network Connections,” the benefit of the earlier filing date of which is hereby claimed under 35 U.S.C. §120 and which is further incorporated herein by reference in its entirety.

TECHNICAL FIELD

This invention relates generally to network communications, and more particularly but not exclusively, to the compression of network communications.

BACKGROUND

According to some studies, the volume of information over a network, such as the Internet, is expected to more than triple over the next three years. Data and content is likely to remain the largest percentage of Internet traffic, with the majority of this information being dynamic. Often, the issues of concern with Internet traffic range from business to consumer response and order times, to the time required to deliver business information to a traveler using a wireless device, to the download time for rich media such as music, videos, and so forth. Thus, not surprisingly, a major complaint among Internet users is a lack of speed. Additionally, users' complaints often center on how long it takes to display a web page, or other content, on their computing device. One solution therefore, may be to send less data. This is where compression may help.

The idea is to compress data being sent from a server, and to have a client's browser decompress this data virtually on the fly, thereby reducing the amount of data sent over the Internet, and increasing a web page display speed. Many, although not all, browsers are now equipped to support the Hypertext Transfer Protocol (HTTP) standard that enables compression as a type of “content-encoding.” Content-encoding can often significantly reduce web page download times for highly compressible content, such as text, style sheets, XML, text document attachments, and HTML. The benefit is especially pronounced for clients communicating over low-bandwidth links. However, while static pages can be pre-compressed on a server, compressing dynamic web pages requires significant server resources, and often can not be pre-compressed. Thus, it is with respect to these considerations and others that the present invention has been made.

BRIEF DESCRIPTION OF THE DRAWINGS

Non-limiting and non-exhaustive embodiments of the invention are described with reference to the following drawings. In the drawings, like reference numerals refer to like parts throughout the various figures unless otherwise specified.

For a better understanding of the invention, reference will be made to the following Detailed Description of the Invention, which is to be read in association with the accompanying drawings, wherein:

FIG. 1 shows a functional block diagram illustrating one embodiment of an environment for practicing the invention;

FIG. 2 shows one embodiment of a server device that may be included in a system implementing the invention;

FIG. 3 shows one embodiment of a routing metric table useable in managing client connection metrics;

FIG. 4 illustrates a logical flow diagram generally showing one embodiment of a process for managing a communication with a client device using network metrics; and

FIG. 5 illustrates a logical flow diagram generally showing one embodiment of a process for determining client network metrics, in accordance with the invention.

DETAILED DESCRIPTION

The invention now will be described more fully hereinafter with reference to the accompanying drawings, which form a part hereof, and which show, by way of illustration, specific exemplary embodiments by which the invention may be practiced. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art. Among other things, the invention may be embodied as methods or devices. Accordingly, the invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. The following detailed description is, therefore, not to be taken in a limiting sense.

As used herein, the term “or” is to be considered to mean “and/or” unless the context clearly indicates otherwise.

Briefly stated, the invention is directed towards a system, apparatus, and method for selectively providing content compression to a client based, in part, on whether the network connection from the client is determined to be a high latency, low-bandwidth connection. Content compression may include any of a variety of compression mechanisms, including HTTP compression, and the like. The present invention gathers one or more network metrics associated with the connection from the client. In one embodiment, the network metrics include estimated TCP metrics, such as round trip time (RTT), maximum segment size (MSS), and bandwidth delay product (BWDP). These estimated network metrics are employed to make an application layer decision. For example, in one embodiment, the estimated network metrics are employed to determine whether the client network connection is a high latency, low-bandwidth connection. If it is, then content may be compressed for transfer over the network connection. In one embodiment, the compression mechanism uses a content encoding compression feature of the HTTP protocol standard. However, the invention is not limited to the above use. For example, the invention may also be employed to make traffic management decisions, to load balance high latency, low-bandwidth network connections towards a predefined server, manage data streaming, and for other application layer decisions. As used herein, application layer refers to layers 5 through 7 of the seven-layer protocol stack as defined by the ISO-OST (International Standards Organization-Open Systems Interconnection) framework.

Illustrative Operating Environment

FIG. 1 illustrates one embodiment of an environment in which the invention may operate. However, not all of these components may be required to practice the invention, and variations in the arrangement and type of the components may be made without departing from the spirit or scope of the invention.

As shown in the figure, system 100 includes client device 102, network 104, traffic management device (TMD) 106, and servers 108-109. Client device 102 is in communication with TMD 106 through network 104. TMD 106 is in further communication with servers 108-109. Although not shown, TMD 106 may be in communication with servers 108-109 through a network infrastructure that is similar to network 104.

Generally, client device 102 may include virtually any computing device capable of connecting to another computing device to send and receive information, including web requests for information from a server, and the like. The set of such devices may include devices that typically connect using a wired communications medium such as personal computers, multiprocessor systems, microprocessor-based or programmable consumer electronics, network PCs, and the like. The set of such devices may also include devices that typically connect using a wireless communications medium such as cell phones, smart phones, radio frequency (RF) devices, infrared (IR) devices, integrated devices combining one or more of the preceding devices, or virtually any mobile device. Similarly, client device 102 may be any device that is capable of connecting using a wired or wireless communication medium such as a PDA, POCKET PC, wearable computer, and any other device that is equipped to communicate over a wired and/or wireless communication medium.

Client device 102 may further include a client application that is configured to manage various actions. Moreover, client device 102 may also include a web browser application, that is configured to enable an end-user to interact with other devices and applications over network 104.

Client device 102 may communicate with network 104 employing a variety of network interfaces and associated communication protocols. Client device 102 may, for example, use various dial-up mechanisms with a Serial Line IP (SLIP) protocol, Point to Point Protocol (PPP), and the like. As such, client device 102 may transfer content at a low transfer rate, with potentially high latencies. For example, client device 102 may transfer data at about 14.4 to about 56 kbps, or potentially more. In another embodiment, client device 102 may employ a higher-speed cable, Digital Subscriber Line (DSL) modem, Integrated Services Digital Network (ISDN) interface, ISDN terminal adapter, and the like. As such, client device 102 may be considered to transfer data using a high bandwidth interface varying from about 32 kbps to over 622 Mbps, although such rates are highly variable, and may change with technology.

Network 104 is configured to couple client device 102, with other network devices, such as TMD 106. Network 104 is enabled to employ any form of computer readable media for communicating information from one electronic device to another. In one embodiment, network 104 is the Internet, and may include local area networks (LANs), wide area networks (WANs), direct connections, such as through a universal serial bus (USB) port, other forms of computer-readable media, or any combination thereof. On an interconnected set of LANs, including those based on differing architectures and protocols, a router may act as a link between LANs, to enable messages to be sent from one to another. Also, communication links within LANs typically include twisted wire pair or coaxial cable, while communication links between networks may utilize analog telephone lines, full or fractional dedicated digital lines including T1, T2, T3, and T4, Integrated Services Digital Networks (ISDNs), Digital Subscriber Lines (DSLs), wireless links including satellite links, or other communications links known to those skilled in the art.

Network 104 may further employ a plurality of wireless access technologies including, but not limited to, 2nd (2G), 3rd (3G) generation radio access for cellular systems, Wireless-LAN, Wireless Router (WR) mesh, and the like. Access technologies such as 2G, 3G, and future access networks may enable wide area coverage for network devices, such as client device 102, and the like, with various degrees of mobility. For example, network 104 may enable a radio connection through a radio network access such as Global System for Mobil communication (GSM), General Packet Radio Services (GPRS), Enhanced Data GSM Environment (EDGE), Wideband Code Division Multiple Access (WCDMA), and the like.

Furthermore, remote computers and other related electronic devices could be remotely connected to either LANs or WANs via a modem and temporary telephone link. In essence, network 104 includes any communication method by which information may travel between client device 102 and TMD 106.

Additionally, network 104 may include communication media that typically embodies computer-readable instructions, data structures, program modules, or other data in a modulated data signal such as a carrier wave, data signal, or other transport mechanism and includes any information delivery media. The terms “modulated data signal,” and “carrier-wave signal” includes a signal that has one or more of its characteristics set or changed in such a manner as to encode information, instructions, data, and the like, in the signal. By way of example, communication media includes wired media such as, but not limited to, twisted pair, coaxial cable, fiber optics, wave guides, and other wired media and wireless media such as, but not limited to, acoustic, RF, infrared, and other wireless media.

TMD 106 includes virtually any device that manages network traffic. Such devices include, for example, routers, proxies, firewalls, load balancers, cache devices, devices that perform network address translation, any combination of the preceding devices, and the like. TMD 106 may, for example, control the flow of data packets delivered to and forwarded from an array of servers, such as servers 108-109. TMD 106 may direct a request for a resource to a particular server based on network traffic, network topology, capacity of a server, content requested, and a host of other traffic distribution mechanisms. TMD 106 may receive data packets from and transmit data packets to the Internet, an intranet, or a local area network accessible through another network. TMD 106 may recognize packets that are part of the same communication, flow, and/or stream and may perform special processing on such packets, such as directing them to the same server so that state information is maintained. TMD 106 also may support a wide variety of network applications such as Web browsing, email, telephony, streaming multimedia and other traffic that is sent in packets. The BIG-IP Traffic Manager, by F5 Networks, is one example of a TMD. The 3-DNS Controller, by F5 Networks, is another example of a TMD.

TMD 106 may receive requests from client device 102. TMD 106 may select a server from servers 108-109 to forward the request. TMD 106 may employ any of a variety of criteria and mechanisms to select the server, including those mentioned above, load balancing mechanisms, and the like. TMD 106 may receive a response to the request and forward the response to client device 102.

TMD 106 may determine various network metrics associated with a network connection between client device 102 and itself. Based, in part, on the network metrics, TMD 106 may perform various application layer decisions. For example, in one embodiment, TMD 106 may compress content sent from servers 108-109 based on whether the network metrics indicate that the network connection is a high latency, low-bandwidth link. In another embodiment, TMD 106 may direct servers 108-109 to perform content compression based on latency or bandwidth metrics. In still another embodiment, TMD 106 may determine that the network connection is a high bandwidth link, and not perform compression. The decision not to compress the content may be directed, for example, towards improving efficiency of TMD 106, and/or servers 108-109. TMD 106 may further employ a process such as described below in conjunction with FIGS. 4-5 to selectively compress the content. TMD 106 or servers 108-109 may have two or more possible compression algorithms or level of compression to use. The compression algorithms or levels may vary in the length of time to execute or the amount of compression that results. In one embodiment, TMD selects one of the compression algorithms or level of compression based on latency, bandwidth, or other metrics described herein.

In another embodiment, TMD 106 may determine a network connection characteristic and, based on the network connection characteristic, select to forward communications to server 108 rather than to server 109. In still another embodiment, TMD 106 may also perform other application layer decisions based on various network connection characteristics, including streaming of content, and the like. In one embodiment, for example, multiple versions of websites may exist on one or more servers. One version of a website may be configured to include, for example, high-resolution images, and the like, while another website includes very few high-resolution images, and the like. TMD 106 may forward communications to the server or web site version having higher resolution images based on its determination that the network connection has low latency or high bandwidth.

In one embodiment, multiple servers may be geographically distributed from each other. TMD 106 may make a decision as to which server is best to respond to a request from client 102, based on whether the client 102 is connected to the network 104 with a high bandwidth connection. TMD may then either forward a communication to the selected server or cause the client request to be redirected to the selected server. HTTP redirection may be used to redirect the client request.

TMD 106 may be implemented using one or more personal computers, servers, multiprocessor systems, microprocessor-based or programmable consumer electronics, network PCs, radio frequency (RF) devices, infrared (IR) devices, integrated devices combining one or more of the preceding devices, and the like. Such devices may be implemented solely in hardware or in hardware and software. For example, such devices may include some application specific integrated circuits (ASICs) coupled to one or more microprocessors. The ASICs may be used to provide a high-speed switch fabric while the microprocessors may perform higher layer processing of packets. An embodiment of a network device that could be used as TMD 106 is network device 200 of FIG. 2, configured with appropriate software.

Servers 108-109 may include any computing device capable of communicating packets with client computer 102. Each packet may convey a piece of information. A packet may be sent for handshaking, i.e., to establish a connection or to acknowledge receipt of data. The packet may include information such as a request, a response, or the like. Generally, packets received by servers 108-109 will be formatted according to TCP/IP, but they could also be formatted using another transport protocol, such as SCTP, X.25, NetBEUI, IPX/SPX, token ring, similar IPv4/6 protocols, and the like. Moreover, the packets may be communicated between servers 108-109, TMD 106, and client device 102 employing HTTP, HTTPS, and the like.

In one embodiment, servers 108-109 are configured to operate as a website server. However, servers 108-109 are not limited to web servers, and may also operate a messaging server, a File Transfer Protocol (FTP) server, a database server, content server, and the like. Additionally, each of servers 108-109 may be configured to perform a different operation. Thus, for example, back-end server 108 may be configured as a messaging server, while back-end server 109 is configured as a database server. Moreover, while servers 108-109 may operate as other than a website, they may still be enabled to receive an HTTP communication.

Devices that may operate as servers 108-109 include personal computers desktop computers, multiprocessor systems, microprocessor-based or programmable consumer electronics, network PCs, servers, and the like.

It is further noted that terms such as client and server may refer to functions within a device. As such, virtually any device may be configured to operate as a client device, a server device, or even include both a client and a server function. Furthermore, where two or more peers are employed, any one of them may be designated as a client or as a server, and be configured to confirm to the teachings of the present invention.

Illustrative TMD Environment

FIG. 2 shows one embodiment of a network device, according to one embodiment of the invention. Network device 200 may include many more or less components than those shown. The components shown, however, are sufficient to disclose an illustrative embodiment for practicing the invention. Network device 200 may represent, for example, TMD 106 of FIG. 1.

Network device 200 includes processing unit 212, video display adapter 214, and a mass memory, all in communication with each other via bus 222. The mass memory generally includes RAM 216, ROM 232, and one or more permanent mass storage devices, such as hard disk drive 228, tape drive, optical drive, and/or floppy disk drive. The mass memory stores operating system 220 for controlling the operation of network device 200.

Operating system 220 may further include networking components 256, routing metrics' store 254, and selective compression manager (SCM) 252. Routing metrics' store is described in more detail below in conjunction with FIG. 3. Networking components 256 may for example, include various components to manage operations of the Open Systems Interconnection (OSI) network stack, including Internet Protocol (IP), TCP, UDP, SSL, HTTP, content encoding (content compression), and similar network related services. Networking components 256 may determine various network metrics, including, TCP maximum segment size (MSS), smoothed round trip time (RTT) for a connection, bandwidth delay product (BWDP), and the like. Networking components 256 may expose such network metrics to SCM 252. Smoothed round trip time includes round trip times that are sampled and smoothed over an interval to minimize impact of outliers, and/or possible aberrant readings from a packet drop, and the like.

Networking components 256 are further configured to retrieve various network metrics and to refine estimates of the network metrics for a given client device/gateway combination, where the gateway may include a router, gateway device, and the like (not shown) between TMD 106 and client device 102 of FIG. 1. Networking components 256 may store the refined estimated network metrics in routing metrics' store 254. Networking components 256 also may employ a process such as described below in conjunction with FIG. 5.

SCM 252 is configured to retrieve the estimated network metrics, and similar network connection characteristics, and to make an application decision based, in part, on them. SCM 252 may enable a user to configure various rules, decisions, events, conditions, and the like, as part of the application decisions. In one embodiment, enabling the user to configure a rule, and the like, may include employing an interpretative language, a complied network programming language, and the like. SCM 252 may enable a user to combine one or more network metrics, and the like, to make a decision to selectively compress content. In one embodiment, for example, the user may implement a rule, condition, and the like, that employs a predetermined threshold value for a network metric. Then, when the network metric falls below the predetermined threshold, the associated network connection may be considered to be a high latency, low-bandwidth link. SCM 252 may then compress content sent on towards the client device. SCM 252 may employ a process such as described in conjunction with FIG. 4 below.

Although illustrated in FIG. 2 as distinct components, networking components 256, routing metrics' store 254, and SCM 252 may be arranged, combined, and the like, in any of a variety of ways, without departing from the scope of the invention. For example networking components 256, routing metrics' store 254, and SCM 252 may be configured to operate a single component. Moreover, SCM 252 and/or routing metrics' store 254 may reside outside of operating system 220. Furthermore, while networking components 256, routing metrics' store 254, and SCM 252 are discussed as residing with TMD 106 of FIG. 1, the invention is not so limited. For example, one or more networking components 256, routing metrics' store 254, and SCM 252 may reside instead within at least one of servers 108-109.

As illustrated in FIG. 2, network device 200 also can communicate with the Internet, or some other communications network, such as network 104 in FIG. 1, via network interface unit 210, which is constructed for use with various communication protocols including the TCP/IP protocol. Network interface unit 210 is sometimes known as a transceiver, transceiving device, or network interface card (NIC).

The mass memory as described above illustrates another type of computer-readable media, namely computer storage media. Computer storage media may include volatile, nonvolatile, removable, and non-removable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, or other data. Examples of computer storage media include RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store the desired information and which can be accessed by a computing device.

The mass memory also stores program code and data. One or more applications 250 are loaded into mass memory and run on operating system 220. Examples of application programs may include email programs, routing programs, schedulers, calendars, database programs, word processing programs, HTTP programs, traffic management programs, security programs, and so forth.

Network device 200 may also include an SMTP handler application for transmitting and receiving e-mail, an IITTP handler application for receiving and handing HTTP requests, and an HTTPS handler application for handling secure connections. The HTTPS handler application may initiate communication with an external application in a secure fashion. Moreover, network device 200 may further include applications that support virtually any secure connection, including TLS, TTLS, EAP, SSL, IPSec, and the like. Similarly, network device 200 may include applications that support a variety of tunneling mechanisms, such as VPN, PPP, L2TP, and so forth.

Network device 200 may also include input/output interface 224 for communicating with external devices, such as a mouse, keyboard, scanner, or other input devices not shown in FIG. 2. Likewise, network device 200 may further include additional mass storage facilities such as CD-ROM/DVD-ROM drive 226 and hard disk drive 228. Hard disk drive 228 may be utilized to store, among other things, application programs, databases, and the like.

In one embodiment, the network device 200 includes at least one Application Specific Integrated Circuit (ASIC) chip (not shown) coupled to bus 222. The ASIC chip can include logic that performs some of the actions of network device 200. For example, in one embodiment, the ASIC chip can perform a number of packet processing functions for incoming and/or outgoing packets. In one embodiment, the ASIC chip can perform at least a portion of the logic to enable the operation of traffic manager 252 and/or pipeline manager 254.

In one embodiment, network device 200 can further include one or more field-programmable gate arrays (FPGA) (not shown), instead of, or in addition to, the ASIC chip. A number of functions of the network device can be performed by the ASIC chip, the FPGA, by CPU 212 with instructions stored in memory, or by any combination of the ASIC chip, FPGA, and CPU.

FIG. 3 shows one embodiment of routing metrics' store 300 useable in managing client network metrics. Although the invention is described below in terms of using a table, the invention is clearly not so limited and virtually any structure may be employed to store network metrics, including a spreadsheet, folder, database, text file, and the like.

Routing metrics' store 300 may include many more or less components than those shown. The components shown, however, are sufficient to disclose an illustrative embodiment for practicing the invention. As shown in the figure, routing metrics' store 300 includes client identifiers 302, gateway identifiers 304, and network metrics 306-308.

As illustrated in the figure, network metrics 306-308 may be associated with a given client/gateway pair (client identifier 302/gateway identifier 304). However, network metrics 306-308 may also be collected and stored based on client identifier 302 and not gateway identifier 304.

Client identifier 302 includes virtually any characters, numbers, and/or combination of characters and numbers that uniquely identify a client device that is in communication with the network device, such as TMD 106 of FIG. 1. Similarly, gateway identifier 304 is intended to uniquely identifier a router, gateway, firewall, and the like that couples the client device to TMD 106. For example, in one embodiment, client identifier 302, and gateway identifier 304 are IP addresses of the client device and the gateway device, respectively.

Network metric 307 represents smoothed round trip time (RTT) for a given client device/gateway combination. RTT typically is employed to determine how long to wait before a packet segment may be deemed to have been dropped during a network transmission and another packet is to be sent. That is, RTT is one metric that may be employed to handle network packet loses during a network communication. Additionally, the RTT metric may also be employed as a mechanism for estimating a client's data latency. For example, an RTT metric value around 500 msec may indicate a high latency client network connection, whereas a value around 10 msec may indicate a low-data latency client network connection. Because an RTT may require time to converge to a representative value of the network connection, network metric 307 may represent a smoothed estimate of RTT.

RTT may be determined in any of a variety of ways. For example, in one technique, an RTT estimate, which is included in the TCP protocol, may be determined based on a time delta between two or more TCP segments exchanged between communicating network endpoints. In another technique, one or more packets may be sent to a target device to elicit a response, and the time until a response packet is received from the target device, or an intermediary device, is measured. TCP packets, ICMP packets, and UDP packets, are examples of packets that can be used to obtain an RTT measurement. Thus, as used herein, determining an MT includes at least all of the above techniques, unless clearly limited otherwise.

A high latency client may not necessarily be a low-bandwidth network connection client, however. For example, a satellite link, trans-Atlantic link, and the like, may have high latencies, but may also be high bandwidth connections. Thus, additional metrics may also be collected.

One such other network metric includes maximum segment size (MSS), shown as network metric 306. Briefly, MSS typically represents a maximum TCP segment size that is not exceeded for a length of a data field in a TCP packet. Ethernet typically offers, for example, an MSS around 1500 bytes for the data field. A typical TCP/IP broadband connection may be between about 1148 to about 1460 bytes, with IPv6 connections typically being around 1440 bytes. MSS values in the 1300-1400 byte range may further indicate a use of a tunneling protocol, such as VPN, Point to Point Protocol over Ethernet (PPPoE), and the like. MSS values around 536 bytes tend to indicate the network connection may be a SLIP, PPP, or similar dial-up connection. Thus, the invention may collect and employ MSS metrics to further determine whether the network connection with a client device is a high-latency, low-bandwidth link.

Another network metric may include network metric 308 as a bandwidth delay product (BWDP). Briefly, BWDP may be employed to determine a maximum amount of data that may be placed on a network connection within a given time window. BWDP may be employed then to further determine a speed of a network connection.

Although only MSS, RTT, and BWDP are illustrated, the invention is not so limited, and other network metrics may be determined and stored in routing metrics' store 300 without departing from the scope or spirit of the invention.

Generalized Operation

The operation of certain aspects of the invention will now be described with respect to FIGS. 4-5. FIG. 4 illustrates a logical flow diagram generally showing one embodiment of a process for managing a communication with a client device using network metrics. Process 400 of FIG. 4 may be implemented, for example, within SCM 252 of FIG. 2.

Process 400 begins, after a start block, at block 402, which is described in more detail in conjunction with FIG. 5. Briefly however, at block 402 estimated network metrics are retrieved for a client connection. In one embodiment, the estimated network metrics include estimated values for RTT, MSS, and BWDP.

Processing continues next to block 404, where the retrieved network metrics are employed to determine a connection characteristic, such as whether the network connection is a high latency, low-bandwidth network connection. One or more of the retrieved network metrics are employed to make this determination. In one embodiment, a MSS value below one predefined threshold value is employed to determine if the client connection is likely to be a high latency, low-bandwidth network connection. Another threshold value may be employed to determine whether the BWDP indicates a high or low-bandwidth network connection. Similarly, RTT values below yet another predefined threshold may indicate a low-bandwidth network connection. These predefined threshold values may be determined based on a variety of criteria, including engineering judgment and experience. The results may then be weighted and/or combined to provide an overall estimate of the client network connection characteristic.

Process 400 next flows to decision block 406, where, based on the determination at block 404, a decision is made whether to perform an application layer operation. As shown in, process 400, one embodiment of the operation includes selectively compressing content communicated with the client device. Selective compression may be employed, for example, where the determination at block 404 indicates that the network connection is a high latency, low-bandwidth connection. If it is determined not to be a high latency, low-bandwidth connection, processing returns to a calling process to perform other actions.

Otherwise, processing flows to block 408, where content from the server may be compressed prior to sending the content to the client device. In one embodiment, content is compressed using the content encoding feature of HTTP. In another embodiment, block 408 may result in forwarding traffic to a predetermined server or servers dedicated to servicing low-bandwidth client devices. Upon completion of block 408, processing returns to a calling process to perform other actions.

FIG. 5 illustrates a logical flow diagram generally showing one embodiment of a process for determining client network metrics. Process 500 of FIG. 5 may be implemented, for example, within networking components 256 of FIG. 2.

Process 500 begins, after a start block, at block 502, where a client connection is established. A client connection may typically be considered to be established after a series of synchronization handshakes, such as a standard TCP SYN/SYN-ACK/ACK handshake, and the like.

Processing moves next to decision block 504, where a determination is made whether there are network metrics for this client/gateway combination. In one embodiment, a routing metrics' store is examined to determine whether such network metrics have been saved from a previous network connection for this client/gateway combination. If there are stored metrics, processing branches to block 514; otherwise, processing proceeds to block 506.

At block 506, because network metrics are not available for this client/gateway combination, the estimated network metrics are seeded with default values. The default values may be selected based on a variety of criteria, including selecting conservative values, non-zero values, and the like. In one embodiment, a default value for MSS may be selected based on an early value for the current client/gateway connection. That is, in one embodiment, the MSS may be supplied on an initial synchronization from the client. Moreover, the default values by include a best-guess approximation of the network metrics that may have been gathered on this connection so far (e.g., a smoothed variance of RTT, a moving averaged of BWDP, and collected initial MSS). Processing proceeds next to block 508.

At block 514, because network metrics are available for this client/gateway combination, they are retrieved. Processing continues to block 516, where the retrieved network metrics are used to seed the estimated network metrics. Processing continues to block 508.

At block 508, additional networking information is collected to further refine estimates of the network metrics. Refinement may include employing a variety of smoothing mechanisms to improve the estimates, including using a moving average approach. Processing flows next to decision block 510, where a determination is made whether the client connection is closed. If it is closed, processing flows to bock 512; otherwise, processing loops back to block 508, where further networking information is collected to continue to refine the estimates of the network metrics.

At block 512, however, when it is determined that the client connection is closed, the refined network metrics are used to update the routing metrics' store for this client/gateway connection combination. Processing then returns to a calling process to perform other actions.

FIG. 5 also illustrates one embodiment of at least three possible locations, E-1 through E-3, where process 400 may operate to retrieve network metrics. Thus, for example, at E-1, after the recognition of the client connection (block 502), initial networking connection metrics may be employed by the application layer to make a decision. Additionally, at E-2, default seed metrics may be employed by process 400. At E-3, process 400 may also enter to retrieve refined network metrics. However, the invention is not limited to these locations, and others may be used to retrieve the network metrics, without departing from the scope or spirit of the invention.

In one embodiment, system 100 includes two or more TMDs. The multiple TMDs may be collocated in a local area network, or geographically distributed. A first TMD having collected metric information from a client may send the metric information to the second TMD, for use in the above-described process. In one embodiment, a first TMD that employs redirection to cause the client to communicate with a second TMD, the second TMD may use two or more IP addresses, where the first TMD redirects the client to one of the IP addresses based on the latency or bandwidth of the client.

It will be understood that each block of the flowchart illustration, and combinations of blocks in the flowchart illustration, can be implemented by computer program instructions. These program instructions may be provided to a processor to produce a machine, such that the instructions, which execute on the processor, create means for implementing the actions specified in the flowchart block or blocks. The computer program instructions may be executed by a processor to cause a series of operational steps to be performed by the processor to produce a computer implemented process such that the instructions, which execute on the processor to provide steps for implementing the actions specified in the flowchart block or blocks.

Accordingly, blocks of the flowchart illustration support combinations of means for performing the specified actions, combinations of steps for performing the specified actions and program instruction means for performing the specified actions. It will also be understood that each block of the flowchart illustration, and combinations of blocks in the flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified actions or steps, or combinations of special purpose hardware and computer instructions.

The above specification, examples, and data provide a complete description of the manufacture and use of the composition of the invention. Since many embodiments of the invention can be made without departing from the spirit and scope of the invention, the invention resides in the claims hereinafter appended. 

1. A system, comprising: at least two or more servers for providing content over a network; and a first traffic management device (TMD) having memory and one or more central processing units to perform actions, including: determining a plurality of network metrics for communications through at least one client device/gateway combination for a client device; and sending the plurality of network metrics to a second TMD operating on another one or more central processing units, the second TMD performing actions, including: combining results of evaluations of each of the plurality of network metrics to a different threshold value into a single network connection characteristic for at least one client device/gateway combination; and based on whether the combined results indicate that a communications with the second TMD is a low-bandwidth connection, selectively providing compressed content to the client device through the associated gateway.
 2. The system of claim 1, wherein the determined plurality of different network metrics includes at least a maximum segment size (MSS) associated with network connections with the client device and each of a plurality of gateways.
 3. The system of claim 1, wherein the second TMD is configured to select from the servers to provide the compressed content based on one of the servers being configured to provide low bandwidth content for network connections having a low bandwidth network connection characteristic, and at least a second of the servers being configured to provide high bandwidth content for network connections having a high bandwidth network connection characteristic.
 4. The system of claim 1, wherein the first TMD is further configured to redirect the client device to communicate with the second TMD based on one of a latency or bandwidth characteristic of a communications with the client device.
 5. The system of claim 4, wherein the redirection is to a selected one of a plurality of Internet Protocol (IP) addresses associated with the second TMD, the selection of the IP address being based on the latency or bandwidth characteristic.
 6. The system of claim 1, wherein the second TMD is configured to selectively compress the content.
 7. The system of claim 1, wherein the servers are geographically located in different locations, and selectively providing the compressed content further comprises selecting one of the servers based on the server's geographic location and the combined results.
 8. An apparatus, comprising: a memory device for storing computer instructions; and at least one processor for executing the computer instructions to perform actions, including: receiving a plurality of network metrics associated with each network connection between a client device and a gateway combination for a plurality of different gateways; evaluating each of the plurality of network metrics to determine whether the respective network metric exceeds a threshold value for a given client device and gateway combination; combining a plurality of results from the evaluations to generate a single network connection characteristic for each of the client device and gateway combinations; and based on whether the combined results indicate that a network connection is a low-bandwidth connection for a given client device and gateway combination, selectively providing compressed content to the client device using the given client device and gateway combination.
 9. The apparatus of claim 8, further comprising: at least one or more other processors for executing computer instructions, including: seeding at least network metric about a network connection for at least one client device/gateway combination; continuously collecting additional networking information about the network connection while the network connection is open; employing the additional networking information to refine each of the plurality of network metrics about the network connection; and sending the refined plurality of estimated network metrics over a network to the at least one processor.
 10. The apparatus of claim 8, wherein selectively providing compressed content comprises compressing HTTP dynamic content at the at least one processor.
 11. The apparatus of claim 8, wherein selectively providing the compressed content further comprises selecting a server from a plurality of servers based on the server's geographic location and the combined results.
 12. The apparatus of claim 8, wherein selectively providing the compressed content further comprises selecting from one of a plurality of servers to provide the compressed content based on at least one of the plurality of servers being configured to provide low bandwidth high latency content for network connections having a low bandwidth high latency network connection characteristic, and at least a second of the plurality of servers being configured to provide high bandwidth content for network connections having at least a high bandwidth network connection characteristic.
 13. The apparatus of claim 8, wherein the network metrics includes at least two of a round trip time (RTT), maximum segment size (MSS), and a bandwidth delay product (BWDP).
 14. The apparatus of claim 8, wherein combining the results of evaluations further comprises: comparing at least two network metrics to different threshold values; obtaining a result for each of the comparisons; and combining the results of the comparisons to generate the single network connection characteristic.
 15. The apparatus of claim 8, wherein the single network connection characteristic further indicates whether the at least one client device/gateway combination is a high latency and low bandwidth network connection.
 16. A processor based method, the method comprising: receiving at a first processor, from a second processor, a plurality of network metrics for at least one client device and a gateway combination; combining results of evaluations of each of the plurality of network metrics to a different threshold value into a single network connection characteristic for the at least one client device/gateway combination; and based on whether the combined results indicate that the network connection is a low-bandwidth high latency connection, selectively providing compressed content to the client device.
 17. The processor based method of claim 16, wherein the selectively providing compressed content further comprises varying the compression algorithm or level of compression based on the combined results.
 18. The processor based method of claim 16, wherein the client device is redirected to a selected one of a plurality of Internet Protocol (IP) addresses associated with a third processor, the selected IP address being selected based in part on the combined results.
 19. The processor based method of claim 16, wherein selectively providing compressed content comprises compressing HTTP dynamic content at the first processor.
 20. The processor based method of claim 16, wherein selectively providing compressed content to the client device further comprises selecting a server from a plurality of servers from which to obtain the content based on the combined results and a geographic location of the servers. 